Reverse-Engineering Visualizations: Recovering Visual Encodings from Chart Images

نویسندگان

  • Jorge Poco
  • Jeffrey Heer
چکیده

We investigate how to automatically recover visual encodings from a chart image, primarily using inferred text elements. We contribute an end-to-end pipeline which takes a bitmap image as input and returns a visual encoding specification as output. We present a text analysis pipeline which detects text elements in a chart, classifies their role (e.g., chart title, x-axis label, y-axis title, etc.), and recovers the text content using optical character recognition. We also train a Convolutional Neural Network for mark type classification. Using the identified text elements and graphical mark type, we can then infer the encoding specification of an input chart image. We evaluate our techniques on three chart corpora: a set of automatically labeled charts generated using Vega, charts from the Quartz news website, and charts extracted from academic papers. We demonstrate accurate automatic inference of text elements, mark types, and chart specifications across a variety of input chart types.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual encodings that support physical navigation on large displays

Visual encodings are the medium through which information is displayed, perceived, interpreted, and finally transferred from a visualization to the user. Traditionally, such encodings display information as representations of length, color, size, slope, position, and other glyphs. Guidelines for such encodings have been proposed, but they generally assume a small display, small datasets, and a ...

متن کامل

Demonstration of the Softvision Software Visualization Framework

When recovering and maintaining the architectures of software systems, effective visual presentations are essential for grasping the concepts and constructs involved. There exist several reverse engineering tools that provide visualizations of architectural information. The tools usually offer a set of views allowing interactive navigation of the architectural information in some predefined way...

متن کامل

Understanding Data-Driven Visual Encodings through Deconstruction

Our ongoing work aims to better understand the relationship between visual marks and data encodings that comprise innovative data visualizations. By innovative, we refer to data visualizations that in part extend or deviate from conventional visualization techniques. Authors of these types of visualizations engage and inform their audience by incorporating novel data-bound compositions. In our ...

متن کامل

An Open Toolkit for Prototyping Reverse Engineering Visualizations

Maintenance and evolution of complex software systems (such as mobile telephones) involves activities such as reverse engineering (RE) and software visualization. Although several RE tools exist, we found their architecture hard to adapt to the domain and problem specific requirements posed by our current practice in Nokia. In this paper, we present an open architecture which allows easy protot...

متن کامل

Graphical Perception in Animated Bar Charts

Interactive visual applications create animations that encode changes in the data. For example, cross-filtering dynamically updates linked visualizations based on the user’s continuous brushing actions. The animated effects resulting from these interactions depends both on how interaction (e.g., brushing speed) controls properties of the animation such as frame rate, as well as how the data tha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Comput. Graph. Forum

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2017